Method and system for enhanced multi-address read operations in low pin count interfaces

ABSTRACT

A memory device supporting multi-address read operations improves throughput on a bi-directional serial port. The device includes a memory array and an input/output port having an input mode and an output mode. The input/output port has at least one signal line used alternately in both the input and output modes. A controller includes logic configured to execute a multi-address read operation in response to receiving a read command on the input/output port, the multi-address read operation including receiving a first address and a second address using the at least one signal line before outputting data.

PRIORITY APPLICATION

This application is a continuation of co-pending U.S. patent applicationSer. No. 17/070,340 filed on 14 Oct. 2020, which application claims thebenefit of U.S. Provisional Patent Application No. 62/985,900 filed 6Mar. 2020; which applications are incorporated herein by reference.

BACKGROUND Field

The present disclosure relates to read operations on serial port memorydevices, and to such operations having enhanced throughput andefficiencies.

Description of Related Art

Integrated circuit memory devices receive commands, data and addressesfor memory operations, and output data in response on input/outputports. Input/output ports on integrated circuits are often characterizedas serial ports or as parallel ports. For example, commercialimplementations of NOR flash memory are usually characterized asparallel flash or serial flash. Parallel flash is implemented with manyaddress and output pins, for example 48 pins, to provide random memoryaccess with high throughput. Serial flash is often implemented using theSerial Peripheral Interface SPI configuration, having limited numbers ofpins. Of course, other serial input/output port configurations areavailable. In general, serial flash does not perform as well for randomread access as parallel flash.

NOR flash memory, because of the capability of random-access, can beused in execute-in-place (XIP) deployments, where programs are executedfrom long-term nonvolatile storage, without requiring a step of copyingprograms into dynamic RAM. However, throughput can be a limitation onuse of NOR flash in XIP settings, or other applications that can requirehigh-speed random-access. Serial flash may not be able to providesufficient throughput for these kinds of applications. However, it isdesirable to utilize the serial flash products, because of greatlyreduced device pin counts and high-bandwidth capabilities.

Techniques for improving the throughput of serial flash includeincreasing clock frequency, and increasing the number of I/O pins usedin the serial protocol. However, latency involved in internal senseamplifier operations, required for moving data out of the nonvolatilememory into output buffers suitable for outputting the data, does notscale with clock rate and pin count. Thus, this latency can become amajor performance bottleneck for random access serial flash.

It is desirable to provide a technology that can enhance readperformance for low pin count memory products, such as serial NOR flash.

SUMMARY

Technology described herein enables low pin count memory, capable ofreceiving and delivering multiple read commands, addresses and dataoutputs on a bi-directional I/O port, where one or more next readcommands or addresses can be issued/received before changing thebi-directional I/O port from input to output mode to output data from aprevious command or address.

A memory device supporting multi-address read operations as describedherein improves throughput on a bi-directional port. An embodiment ofsuch a device includes a memory array and an input/output port having aninput mode and an output mode. The input/output port has at least onesignal line used alternately in both the input and output modes. Acontroller includes logic configured to execute a multi-address readoperation in response to receiving a read command on the input/outputport in the input mode, the operation including receiving a firstaddress and a second address using the at least one signal line in theinput mode before switching to the output mode, switching to the outputmode and outputting data identified by the first address using the atleast one signal line.

A method for executing a multi-address read operation on a memory deviceincluding a memory array and an input/output port having an input modeand an output mode, wherein the input/output port has at least onesignal line used alternately in both the input and output modes isdescribed. The method includes, in general, receiving a read command onthe input/output port in the input mode in a multi-address readoperation, and in response to the read command, receiving a firstaddress and a second address using the at least one signal line in theinput mode before switching to the output mode; switching to the outputmode; and outputting data identified by the first address using the atleast one signal line.

The technology described herein includes a number of protocols for themulti-address read operation. One protocol is described that includesreceiving two read addresses, switching to output mode after a readlatency for the first address, and outputting the data of the first readaddress, and then having a sequence of interleaved read address, I/Oturn around, and data output phases. Another protocol is described thatincludes receiving a sequence including at least two read addresses,switching to output mode after receiving the sequence of read addressesand expiry of a read latency appropriate for data to be output, andoutputting the data chunks of the sequence of read addresses in anon-interleaved mode. Yet another protocol is described that includesreceiving at least two read commands with read addresses in sequencewithout switching to the output mode, internally moving data identifiedby the read addresses from the memory array to an input/output buffer,and then receiving read-out commands with addresses identifying chunksof data in the input/output buffer, and outputting the data chunks inresponse to the read-out commands.

Technology described herein provides for optimization of throughput formemory devices for low pin count integrated circuits. The technologydescribed herein improves throughput for low pin count NOR flashintegrated circuits, and other type of memory integrated circuits beingdeployed for XIP systems and other systems relying on efficient randomaccess to nonvolatile memory.

Other aspects and advantages of the present technology can be seen onreview of the drawings, the detailed description and the claims, whichfollow.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a is a simplified diagram of an integrated circuit memoryhaving a bi-directional input/output port and enhanced read capabilitiesas described herein.

FIG. 2 illustrates a serial NOR flash having an 8-pin SPI interface, andsupporting enhanced read capabilities as described herein.

FIG. 3 illustrates a prior art read protocol for a serial port having abidirectional I/O port.

FIG. 4 illustrates an enhanced read protocol having interleaved dataoutput phases and read address input phases after an initial data outputphase according to a first embodiment of the technology describedherein.

FIG. 5 illustrates a command decoder and control logic for an enhancedread protocol like that of FIG. 4.

FIGS. 6A and 6B illustrate an enhanced read protocol havingnon-interleaved data output and read address input phases supportingmultiple read commands according to a second embodiment of thetechnology described herein.

FIG. 7 illustrates a command decoder and control logic for an enhancedread protocol like that of FIGS. 6A and 6B.

FIG. 8 is a simplified diagram of an integrated circuit memorypartitioned into multiple banks supporting two-phase enhanced readcapabilities as described herein.

FIG. 9 illustrates a two-phase enhanced read protocol which can beexecuted on a device like that described with reference to FIG. 8.

FIG. 10 illustrates another embodiment of a two-phase enhanced readprotocol which can be executed on a device like that described withreference to FIG. 8.

FIG. 11 illustrates a command decoder and control logic for an enhancedread protocol like that of FIG. 9 without a final read indicator.

FIG. 12 illustrates another embodiment of a two-phase enhanced readprotocol, including burst length and final read indicator parameters.

FIG. 13 illustrates a command decoder and control logic for an enhancedread protocol like that of FIG. 12.

DETAILED DESCRIPTION

A detailed description of embodiments of the present invention isprovided with reference to the FIGS. 1-13.

FIG. 1 is a simplified diagram of a memory device 100 including a memoryarray 101 that comprises a plurality of banks, bank 0, bank 1, bank 2,bank 3 in this example. The memory device 100 includes a bi-directionalI/O port 102 including a data buffer, connected by signal lines to asystem serial I/O bus 110. The I/O port 102 has an input mode and anoutput mode for each bidirectional signal line, including at least onesignal line used in both the input and output modes of a multi-addressread operation as described herein.

The memory device 100 includes a controller 103 with an enhanced readcommand decoder, connected to the I/O port 102 and controls thecomponents of the device to execute read operations in response tocommands and addresses, and output resulting data. In the illustration,the controller 103 is connected to address buffers 104, and to senseamplifiers (including a data register) 107. The address buffers 104connect to a row decoder 105 and column decoder 106. In a readoperation, a command and one or more addresses are received on the I/Oport 102, and decoded by the command decoder in controller 103. Thecontroller 103 executes logic to perform the operation indicated by thecommand, including accessing the memory array 101, causing senseamplifiers to load a data register with the data from the memory array,and transferring data from the data register in the sense amplifiers 107via the I/O port 102 onto the system serial I/O bus 110. The timeinterval between decoding a read command, and receiving output data atthe port 102 to be supplied on the system serial I/O bus 110 is referredto as a read latency which is determined by the performancecharacteristics of the memory array. In some embodiments, configurationparameters for a multi-address read operation are stored inconfiguration registers 111 accessible by the controller 103, and usedin execution of a multi-address read operation as described herein.

In embodiments described herein, a first read command can be received,and the operation begun by the controller 103, and during the readlatency for the first read command, a second read address can bereceived, before changing the I/O port to the output mode and deliveringthe data output in response to the first read command on the systemserial I/O bus 110.

For the purposes of this description the term “bus” refers to acommunication system that transfers data between bus nodes. The I/O port102 on the device 100 is a bus node on the system serial I/O bus 110.The I/O port includes the signal lines (physical layer connectors likewires, pins, contact pads or balls, optical fiber, etc.), andinput/output circuits including drivers on the devices that togetherimplement a communication protocol.

FIG. 2 illustrates an SPI NOR flash package 200 that can supportenhanced read capabilities as described herein. The illustrated packageis an 8-pin package, including pins for power (8) and ground (4). Thesix remaining pins include a chip select CS# pin (1), a serial clockSCLK pin (6), and four pins (2, 3, 5, 7) that can be configured asbidirectional signal lines SI/O0, SI/O1, SI/O2, SI/O3. Each of the fourpins can be configured in other operating modes as well, includingoutput mode signal line SO on pin 2, a control signal WP# on pin 3, acontrol signal HOLD# on pin 7, and an input mode signal line SI on pin5. According to the SPI standard, in a quad mode, for example, all fourof the bidirectional signal lines can be used in an input mode and anoutput mode in multi-address read operations as described herein.

Serial Peripheral Interface SPI devices in dual or quad modes use thesame pins for input of commands and addresses, and output of read datain response to the input commands and addresses. Thus, the interfaceincludes at least one line which has an input mode and an output modeand must change modes between input mode and output mode to complete aread operation executed on the device in response to a single command.The change between input mode and output mode on an I/O pin can bereferred to as a turn-around phase. See, Enhanced Serial PeripheralInterface (eSPI) Interface Base Specification (for Client and ServerPlatforms), Rev. 1, Intel, January 2016.

FIG. 3 illustrates a read protocol on a bidirectional I/O port accordingto the prior art. In this protocol, the chip select signal istransitioned at time 301 and a first read command and address RC1 arereceived. After a read latency L1, the data output in response to thefirst read command and address is provided on the bidirectional I/Oport, and the chip select signal is transitioned at time 302. For a nextread command, the chip select signal is transitioned at time 303, andthe second read command and address RC2 is received. After a readlatency L2, the data output, in response to the second read command andaddresses provided on the bidirectional I/O port and the chip selectsignal, is transitioned at time 304. During the read latency times L1and L2, the bidirectional I/O port can be held in a sequence of dummycycles. This sequence can continue as the host reads data from the chip.However, the read latency times L1 and L2 limit the availablethroughput.

FIG. 4 shows an embodiment of an enhanced read protocol according to thepresent technology. In this protocol, the chip select signal istransitioned at time 401, and a multi-read MR command 402 is receivedfollowed by two read addresses RA1 and RA2 by the bidirectional I/O portin the input mode.

Immediate data in a command and address sequence is defined as avariable or flag received with or as part of the command or address, andapplied in execution of the command. In the embodiment shown in FIG. 4,immediate data including a burst length parameter BL is provided withthe read addresses indicating a length of a data chunk to be output atthe corresponding address in this embodiment. For the purposes of thisdescription, the term “burst” does not imply any particular busoperation but rather is used as a label of the parameter indicatingchunk length, although in some embodiments an entire chunk can be outputin a single burst operation on the bus. In this example, a burst lengthof eight bytes is provided with read address RA1, and a burst length of16 bytes is provided with read address RA2. The burst length values canbe constrained in some embodiments to a certain predetermined number ofoptions. In other embodiments, the burst length values are constrainedonly by the size of the immediate data provided with the read addressused to define the burst length. In other embodiments, the burst lengthis a parameter stored in a configuration register.

After the read latency L1 for the first read address RA1, the I/O portchanges from the input mode to the output mode, and the eight bytes ofdata chunk DO1 from RA1 are output on the I/O port. After finishingoutputting data chunk DO1, the I/O port executes a turnaround at time403, and receives a next read address RA3 with a burst length parameter.Then, the I/O port executes a turnaround at time 404, and the 16 bytesof data chunk DO2 from RA2 are output on the I/O port. Then, the I/Oport executes a turnaround at time 405, and receives a next read addressRA4 with a burst length parameter. Then, the I/O port executes aturnaround at time 406, and the 16 bytes of data chunk DO3 from RA3 areoutput on the I/O port. Then, the I/O port executes a turnaround at time407, and receives a next read address RA5 with a burst length parameter.Then, the I/O port executes a turnaround at time 408, and the 8 bytes ofdata chunk DO4 from RA4 are output on the I/O port. The interleavedRA(x)-data chunk DO(x−1) sequence can continue as needed, receivingaddress RA(x), turning around the I/O port, and outputting data of thepreceding command data chunk DO(x−1) identified by a preceding readaddress RA(x−1) in the sequence, in this example, in which the initialinput mode receives two read addresses, RA1 and RA2. The end of thesequence can be signaled by transition of the chip select signal.

Note that the timing of outputting of data chunk DO3 may not coincidewith the end of the input of RA4, because of the read latency L3requirements.

FIG. 5 is a flowchart illustrating operation of the command decoder andcontroller for the execution of a protocol like that of FIG. 4,receiving two read addresses, switching to output mode of the readlatency for the first address and outputting the data of the first readaddress, and then having a sequence of interleaved read address and dataoutput phases. The operation starts at block 500. The controller waitsfor transition of the chip select signal to a low state for an activelow system (501). Upon transition of the chip select signal, a commandis received (502). The command is decoded and it is determined whetherit is a multiple read command MR (503). If not, it is determined whetheranother valid command is received (504). If another valid command isreceived, then the command is executed (505). After executing thecommand, or if a valid command was not received at block 504, theprocedure waits for transition of the chip select signal 506. Upontransition of the chip select signal, the algorithm returns to block 501to wait for a subsequent transition of the chip select signal.

If at block 503, it is detected that a multiple read command isreceived, then the address and burst length parameters are received onthe I/O port in an input mode, and an internal read operation using, forexample, a read state machine adapted for the particular type of memoryarray, is triggered using the received address (510). The controllerwaits until the internal read operation is completed (511) and the datais available at the I/O port, as indicated by the looping back to block510. Upon completion of the read operation, the controller waits an I/Oswitching delay while the I/O port is turned around (512). After theswitching delay, the data is output from the buffer in the I/O port witha corresponding burst length (513). It is then determined whether thechip select signal has transitioned at step 514. If not, then thecontroller waits for another I/O switching delay (515) turning back tothe input mode. Then, the controller receives a next address andassociated burst length parameter, and triggers an internal readoperation (516). The procedure then returns to block 512, and waits forthe I/O port to turn around, and then outputs data from the buffer withthe corresponding burst length for a preceding read address (513), andthe procedure continues until the chip select signal is transitionedhigh. When it is determined that the chip select signal is high at block514, then the procedure returns to block 501 to wait for transition ofthe chip select signal again to start a new sequence.

In this embodiment, there may be an unnecessary address input before thelast data output. For example, if the system just wanted to output datachunk DO1 to data chunk DO4, the system could set the chip select signalto transition after finishing data chunk DO4. Because the system iswaiting for a new read address input after data chunk DO3, the hostwould be required to issue a dummy read address RA5 before outputtingdata chunk DO4. To address this problem, in one embodiment each readaddress RAx can be received with immediate data in the form of a finalread flag FR. Thus, the final read flag could be clear (0) for RA1 toRA3 and set (1) for RA4. In this case, data chunk DO4 could seamlesslyfollow data chunk DO3.

In some embodiments, the adjacent read addresses RA(x) and RA(x+1) mayhave some constraints. For example, RA1 and RA2 may be required toaddress different banks of the memory array, and RA2 and RA3 may berequired to address different banks of the memory array, while RA1 andRA3 may fall in the same bank.

In another embodiment, each read address may be provided with immediatedata indicating a burst length so each RAx may be associated withdifferent size data chunks.

In another embodiment, the MR command may be received with immediatedata indicating the number of reads NR and burst length BL for themulti-read operation. Also, the burst length may be set by a parameterregister or otherwise pre-configured.

Also, the MR command can carry other parameters as immediate data, suchas output strength parameters used to set a drive strength of drivers inthe I/O port for the operation.

FIGS. 6A and 6B show another embodiment of an enhanced read protocolaccording to the present technology. Referring to FIG. 6A, in thisprotocol, the chip select signal is transitioned at time 600, and amulti-read command MR is received, with immediate data indicating thenumber of read addresses NR to be included in the operation. In thisexample, the first multi-read MR command includes NR equals two. Whilethe I/O port is in the input mode, two read addresses RA1 and RA2 arereceived. The I/O port is in a dummy cycle or waiting state, duringwhich it can change to the output mode. After a read latency L1 for thefirst read address RA1, the data chunk DO1 from the first read addressis output, followed by output of data chunk DO2 from the second readaddress. Data chunk DO2 can be output on the I/O port beginning in aclock cycle immediately following the last bit of data chunk DO1. Afterthe output of the data chunk DO2, the chip select signal is transitionedat time 601. Then, the system waits for chip select transition at time602, followed by, in this example, another multi-read command includinga number of reads NR equal to six.

FIG. 6B picks up the sequence just prior to the transition of the chipselect signal at time 602. After the multi-read command MR, six readaddresses RA3 to RA8 are received in sequence, without exiting the inputmode on the I/O port. Then, the I/O port turns around at time 603. Afterthe turnaround delay, the data chunks DO3 through DO8 are output insequence. After finishing the outputting of data chunk DO8, the chipselect signal transitions again at time 604. In this example, the I/Oport finishes receiving RA8 after expiration of the read latency L3associated with the first read address RA3 associated with this MRcommand. By doing a non-interleaved multiple read command includingissuing a sequence of addresses, followed by contiguous data output fromthe sequence of addresses, the bus transfer efficiency is greatlyimproved. Bus idle time due to waiting for read latency is greatlyreduced or eliminated.

In other examples, having a fewer number of read addresses, for example,there may be a latency required before outputting the data chunk DO3 andthe following chunks of data as the system waits for the read latency L3to complete.

In variations of the embodiment illustrated in FIGS. 6A and 6B, a numberof reads NR parameter can be specified as immediate data with themulti-read command code as discussed above, at the same clock latchingedge or a nearby clock latching edge between the multi-read command inthe first read address. In some embodiments, the read number NR ispreconfigured to a number, such as four, as a parameter in aconfiguration register, and not specified with the multi-read command.In this case, the system is set up to read a specified number ofaddresses and provide a specified number of outputs for each commandcycle.

As discussed above, the read addresses in the sequence may have someconstraints, such as reading from different banks and the like.

In other embodiments, the second multi-read command MR can be issuedwithout transition of the chip select signal. For example, the second MRcommand may be received right after finishing the output data chunk DO2,while chip select remains low. Bus utilization efficiency is furtherimproved by reducing bus idle time while chip select is high. Inembodiments shown in FIG. 6A and 6B, the burst length BL (data length)is pre-configured. In another embodiment, the burst length BL can becarried as immediate data with the multi-read command. Also, in analternative, the burst length can be carried as immediate data with eachread address, so that the size of the output data associated with eachread address can vary.

Furthermore, the multi-read command can also carry other indicators asimmediate data in addition to the number of reads for the burst length.For example, the output strength indicator can be carried to tell thememory how strong the data is to be driven on the system bus.

FIG. 7 is a flowchart illustrating operation of the command decoder andcontroller for the execution of a protocol like that of FIGS. 6A and 6B,receiving two or more read addresses, switching to output mode after theread latency for the first address and outputting the data of the firstread address, and then having a non-interleaved sequence of dataoutputs. The operation starts at block 700. The controller waits fortransition of the chip select signal to a low state for an active lowsystem (701). Upon transition of the chip select signal, a command isreceived (702). The command is decoded and it is determined whether itis a multiple read command MR with a number of reads NR parameter (703).If not, it is determined whether another valid command is received(704). If another valid command is received, then the command isexecuted (705). After executing the command, or if a valid command wasnot received at block 704, the procedure waits for transition of thechip select signal 706. Upon transition of the chip select signal, thealgorithm returns to block 701 to wait for a subsequent transition ofthe chip select signal.

If, at block 703, it is detected that a multiple read command with anumber of reads NR parameter is received, then the address and burstlength parameters are received on the I/O port in an input mode, and aninternal read operation using for example a read state machine adaptedfor the particular type of memory array, is triggered using the receivedaddress (710). The controller waits until the internal read operation iscompleted (711) and the data is available at the I/O port, as indicatedby the looping back to block 710. Upon completion of the read operation,the controller waits an I/O switching delay while the I/O port is turnedaround (712). After the switching delay, the data is output from thebuffer in the I/O port with a corresponding burst length (713). Next, itis determined whether the number of reads parameter NR has been reached(714). If not, then the algorithm loops to output the next data chunkfrom the buffer with the corresponding burst length at block 713. If thenumber of reads parameter has been reached at block 714, then thecontroller determines whether the chip select signal is high. If not,then the controller waits the I/O switching delay at block 716, andreturns to block 702 to wait for the next command. If at block 715 thechip select signal is high, then the algorithm loops back to block 701to wait for the transition of the chip select signal.

FIG. 8 illustrates aspects of a memory chip architecture for anintegrated circuit memory 800 suitable for enhanced read operations asdescribed herein. The memory is partitioned in this example into fourbanks BA1 to BA4. Each bank (e.g. bank BA4) includes an array 811, 812,813, 814, coupled with peripheral circuits. Referring to bank 814, theperipheral circuits include a row decoder 821, and a column decoder 825.An input output I/O buffer 850 associated with the I/O port is coupledto the system bus 801. The circuits receive bank addresses which selecta bank, and provide page addresses to a page address register (e.g.822), and byte addresses to a byte address register (e.g. 823). The pageaddress is used to select one of the word lines in the array, and senseresults at the sense amplifier and read page buffer (e.g. 826). The byteaddress operates the byte multiplexer 827, which selects the bytes inthe read page buffer to be sent to the I/O buffer 850. The I/O buffermay be used by an SPI port or other I/O port configuration, to outputthe read data on one or more bi-directional lines. This type ofarchitecture supports enhanced read operations using two types ofcommands, a read command RCx address to a bank and page in the memoryarray, and a read-out command ROx addressed to the I/O buffer 850, andlinked to a read command by at least a bank address.

FIG. 9 illustrates an enhanced read protocol suitable for a device likethat of FIG. 8 using two types of commands, a multiple, or “continuous”,read command RCx and a readout command ROx. The read command RCxincludes a command instruction code, a bank address and the page addressto be accessed. After receiving the read command, the decoder and senseamplifier on the selected bank are activated to do the operations toread data from the array which takes a read latency L1 to the buffer. Areadout command ROx includes a command instruction code, a bank addressand the byte address to be accessed. After receiving the readoutcommand, the data in the read page buffer of the specified bank is sentto the byte multiplexer and to the I/O buffer on the serial port. Themove from the page buffer to the I/O buffer is a much shorter latencythan that from the array to the page buffer. The data chunk DOx isoutput to the system bus in response to the readout command ROx. Theread command and read-out command are associated if their bank addressesare the same. For example, if a read command RC1 and read-out commandsRO1 and RO3 all specify the same bank, then the readout commands RO1 andRO3 can be issued after the latency L1 associated with the read commandRC1.

In the illustrated example, on transition of the chip select signal attime 901, a first read command RC1 is received with a bank address BKm,where m can be any one of banks one through four in the example of FIG.8. After completion of the read command and address, the chip selectsignal transitions at time 902 and the system waits for the nexttransition of the chip select signal. In this example, the chip selectsignal transitions at time 903 to receive a second read command RC2. Thesecond read command RC2 including bank address BKn is received beforethe latency L1 associated with the first read command RC1 is expired. Inthis example, after completion of receiving the second read command RC2,the chip select signal transitions again at time 904. At time 905, thechip select signal transitions to begin input mode on the bidirectionalI/O port to receive the third read command RC3 including bank addressBKp. After completion of receiving the third read command RC3, the chipselect signal transitions again (906), and the system waits for the chipselect signal to transition low at time 907. At time 907, the readlatency L1 for the first read command is expired, and the systemreceives the first readout command RO1 including bank address BKm,matching the bank address of the first read command RC1. On completionof receiving the first readout command, data in the page read buffer ismoved to the I/O buffer during latency 908, and the I/O bus switchesfrom the input mode to the output mode. After latency 908, the datachunk DO1 responsive to the first read command RC1 is output.

Upon completion of the outputting of the data chunk D01, the chip selectsignal transitions again at time 909, and then the system waits for itto transition at time 910. At time 910, the latency L2 associated withthe second read command RC2, and the latency L3 associated with thethird read command RC3 are both completed. Thus, the system receives,and can process, a second readout command RO2 with bank address BKp,which matches the bank address of the third read command RC3. Afterreceiving the second readout command, and the latency 911 for moving thedata to the I/O buffer, the data chunk DO2 responsive to the third readcommand RC3 is output.

The chip select signal transitions again at time 912, and then thesystem waits for it to transition at time 913. Then, the system receivesa fourth read command RC4 with bank address BKp. After receiving thefourth read command RC4, the chip select signal transitions high at time914.

After transition low at time 915, the latency L1, L2 and L3 associatedwith the first, second and third read commands RC1, RC2, and RC3 arecompleted. Thus, the system receives, and can process a third readoutcommand RO3 with any of the bank addresses BKp, BKn or BKp, whichmatches the bank address of one of the read commands RC1, RC2 or RC3.After receiving the third readout command, in the example a bank addressmatching that of the first read command RC1, and the latency 916 formoving the data to the I/O buffer, the data chunk DO3 responsive to thefirst read command RC1 is output. Thereafter, the chip select signaltransitions at time 917. The process can continue as necessary.

FIG. 10 illustrates an alternative flow using multiple, or “continuous”,read commands and read-out commands as discussed above in connectionwith FIG. 9. In the embodiment of FIG. 10, commands can be received attimes other than the transition of the chip select signal, includingafter finishing of a preceding read command, and after finishing of apreceding data output, with any necessary output-to-input switchinglatency required to turn around the bidirectional I/O port. Thus, asillustrated at time 1001, first second and third read commands RC1, RC2and RC3 are received without intervening switching on the chip selectline. In some embodiments the first bit of RC(x) is receivedcontiguously in the next clock cycle after the last bit of the previousRC(x−1). Upon completion of receiving RC3, the chip select signaltransitions at times 1002 and 1003. After transition at time 1003, anext read command RC4 is received followed contiguously by a firstread-out command RO1 after completion of the latency L1. The firstread-out command RO1 includes the bank address BKm matching that of thefirst read command RC1. This causes the data moved from the page bufferto the I/O buffer with the read-out latency 1004, followed by the datachunk DO1. After completion of outputting the data chunk DO1, the I/Obus executes a turnaround, with an output to input switching latency1005, and thereafter a second read-out command RO2 is received, withoutan intervening transition of the chip select signal. In this example,RO2 as the same bank address as the third read command RC3 and isreceived after completion of the latency L3. Thus, after the read-outlatency time 1006, the second data chunk DO2 from bank BKp is output onthe system bus. In this example, the I/O bus turns around at time 1007,and a fifth read command RCS is received followed by a third read-outcommand RO3. In this example, the third read-out command is the bankaddress of the first read command RC1, and after the read-out latency1008, the third data chunk DO3 is output. In this example, the chipselect signal transitions at time 1009. At time 1010, the chip selectsignal transitions again and a fourth read-out command RO4 is received.After the read-out latency 1011, the fourth data chunk DO4 is output.Finally, the chip select signal transitions at time 1012 to completethis operation.

The embodiment shown in FIG. 10 can improve I/O bus utilization byreducing the time period required by the chip select high to lowtransitions.

In some embodiments, the read-out commands can carry a burst lengthindicators BL to specify the data link to the corresponding data output.For example, RO2 might indicate a burst length of 32 bytes, while theread-out commands RO1, RO3 and RO4 indicate a burst length of 16 bytes.

FIG. 11 is a flowchart illustrating logic executed by a command decoderand controller for protocol like that of FIG. 10. The operation startsat block 1100. The controller waits for transition of the chip selectsignal to a low state for an active low system (1101). Upon transitionof the chip select signal, a command is received (1102). The command isdecoded and it is determined whether it is a continuous read command RCx(1103). If not, it is determined whether the command is a read-outcommand ROx (1104). If not, it is determined whether another validcommand is received (1105). If another valid command is received, thenthe command is executed (1106). After executing the command, or if avalid command was not received at block 1106, the procedure waits fortransition of the chip select signal 1107. Upon transition of the chipselect signal, the algorithm returns to block 1101 to wait for asubsequent transition of the chip select signal.

If at block 1103, it is detected that a read command is received, thenan internal read operation using for example a read state machineadapted for the particular type of memory array, is triggered using thebank and page address to move the page to the page buffer (1110). Thenthe procedure determines whether the chip select signal is high (1111),and if so, then it loops to block 1101 to wait for chip select totransition low, and waits for a next command cycle. If at block 1111,the chip select signal remains low, then the procedure loops to block1102 to receive a next command in the multi-address read operation onthe system I/O bus.

If at block 1104, a read-out command is detected, then the logic waitsthe I/O switching delay to turn around the I/O port (1115), and outputsthe ROx data chunk DOx from the page buffer to the I/O buffer, with thecorresponding burst length (1116). If at this time, the read operationmoving the data of the first address in the address sequence to thebuffer is not complete, then the read out operation waits until the datais available in the buffer. Then the procedure determines the state ofthe chip select signal (1117). If chip select is high, then theprocedure loops to block 1101. If chip select remains low at block 1117,then the I/O bus turns around, and after waiting the I/O switching delay(1118), the procedure loops to block 1102 to receive a next command.

A similar logic is applied for the protocol of FIG. 9, modified (e.g. atblock 1111) to require a chip select transition before receiving a nextcommand.

FIG. 12 illustrates yet another protocol including continuous readcommands and read-out commands, with the addition of fewer chip selecttransitions, along with burst length parameters and final readindicators. The protocol illustrated, the chip select signal transitionsat time 1200, and three read commands RC1, RC2 and RC3 are receivedcontiguously until the chip select signal transitions at time 1201. Attime 1202, the chip select signal transitions low again, and a fourthread command RC4 is received followed by two contiguous read-outcommands RO1 and RO2. The first read-out command RO1 includes a burstlength parameter of eight bytes, and a final read flag equal to zero.The second read-out command RO2 includes a burst length parameter equalto 16 bytes, and a final read flag equal to one. This indicates that itis the last read command in the sequence. Thus the controller turnsaround the I/O bus, and moves data chunk DO1 from bank BKm of the firstread-out command RO1 and the first read command RC1, to the I/O buffer,and after latency 1203 for the data movement and turn around, outputsthe first data chunk DO1, having a length of eight bytes, followed bythe second data chunk DO2 having a length of 16 bytes from the bank BKp(corresponding to the third read command RC3). After a turn-aroundlatency 1204, the I/O bus turns around, and a next read-out command RO3is received followed by a next read command RCS and a fourth read-outcommand RO4 in sequence. The fourth read-out command RO4 has the finalread flag FR set to one, so the system moves data from bank BKmindicated in the third readout command RO3 to the I/O buffer, and afterlatency 1205 for the data movement and turn around, outputs the thirddata chunk DO3 with a burst length of 16 bytes, followed by the fourthdata chunk DO4 with a burst length of eight bytes. In this example, thechip select signal transitions at time 1206 back to a high state.

In this protocol, the memory outputs data after receiving a read-outcommand with the final read flag equal to one such as in the read-outcommands RO2 and RO4 in the example. The memory outputs datasequentially in response to the previous read-out command (data chunkDO1 and data chunk DO2). As illustrated, multiple read commands as wellas the corresponding data chunks can be issued in a group manner. Thusthe read-out latency and output to input switching latency can befurther offset. The bus transfer efficiency is therefore improved evenfurther for this embodiment.

In the illustrated embodiments, the burst length parameter is carriedwith the read-out command. In other embodiments, the burst length can bepreconfigured or carried with the corresponding read command. In somesequences, the short latency required between each read-out command andits corresponding output data may be encountered. However, this read-outlatency is much shorter than the read latency required to move the datafrom the memory array into the page buffer.

In other embodiments, the read command can include other indicators inaddition to the page address. For example, the read command may carry asimmediate data the required number of clock signals needed to satisfythe read latency. Also, the read-out command can carry as immediate dataother indicators in addition to the by address, the burst length and thefinal read flag. For example, the read-out command may include asimmediate data the required latency clock count to be waited for movingthe data from the page buffer to the I/O buffer.

FIG. 13 is a flowchart illustrating command decoder and control logicwhich can be used with the system implementing an embodiment like thatof FIG. 12. The operation starts at block 1300. The controller waits fortransition of the chip select signal to a low state for an active lowsystem (1301). Upon transition of the chip select signal, a command isreceived (1302). The command is decoded and it is determined whether itis a continuous read command RCx (1303). If not, it is determinedwhether the command is a read-out command ROx with an FR flag (1304). Ifnot, it is determined whether another valid command is received (1305).If another valid command is received, then the command is executed(1306). After executing the command, or if a valid command was notreceived at block 1306, the procedure waits for transition of the chipselect signal 1307. Upon transition of the chip select signal, thealgorithm returns to block 1301 to wait for a subsequent transition ofthe chip select signal.

If at block 1303, it is detected that a read command is received, thenan internal read operation using for example a read state machineadapted for the particular type of memory array, is triggered using thebank and page address to move the page to the page buffer (1310). Thenthe procedure determines whether the chip select signal is high (1311),and if so, then it loops to block 1301 to wait for chip select totransition low, and waits for a next command cycle. If at block 1311,the chip select signal remains low, then the procedure loops to block1302 to receive a next command on the system I/O bus.

If at clock 1304, a read-out command is detected, then the system checksthe final read flag (1315). If the final read flag is clear (equal tozero), then the read-out command is pushed into a command queue foroutput after the final read flag is set (1316). Next, the algorithmdetermines whether the chip select signal is high (1311) and proceedsaccordingly. If at block 1315, the FR flag is set (equal to one), thenafter waiting for the I/O switching delay (1317), the controller outputsthe data chunks DOx from the ROx commands in the queue (1318). Afteroutputting the sequence of data chunks data, the algorithm determineswhether the chip select signal is high (1319). If not, then afterwaiting the I/O switching delay 1320, the algorithm loops back to block1302 to receive a next command. If the chip select signal is not high atblock 1319, in the algorithm loops back to block 1301 to wait fortransition of the chip select signal. This cycle continues as necessaryto complete the read operation.

A number of flowcharts illustrating logic executed by a memorycontroller or by memory device are described herein. In the flow charts,the chip select signal is relied upon as the control signal indicatingstarting and stopping of operations. In some embodiments, other controlsignals can replace the chip select signal, or be used in combinationwith the chip select signal for this purpose.

The logic illustrated in the flow charts can be implemented usingprocessors programmed using computer programs stored in memoryaccessible to the computer systems and executable by the processors, bydedicated logic hardware, including field programmable integratedcircuits, and by combinations of dedicated logic hardware and computerprograms. With all flowcharts herein, it will be appreciated that manyof the steps can be combined, performed in parallel or performed in adifferent sequence without affecting the functions achieved. In somecases, as the reader will appreciate, a rearrangement of steps willachieve the same results only if certain other changes are made as well.In other cases, as the reader will appreciate, a re-arrangement of stepswill achieve the same results only if certain conditions are satisfied.Furthermore, it will be appreciated that the flow charts herein showonly steps that are pertinent to an understanding of the invention, andit will be understood that numerous additional steps for accomplishingother functions can be performed before, after and between those shown.

Thus, the memory device is described roughly illustrated in FIG. 1,having a bidirectional I/O interface where command address and data aretransferred through the same I/O bus bidirectionally, such that in orderto change from an input mode to an output mode, at least one signal lineon the bus has to change directions. The system is capable of executingmultiple read operations in an overlapping manner, after receiving twoor more of a first type of read command, before executing a second typeof read command to output the data. In the illustrated system, secondread command can be received after receiving a first read command, andbefore the read-out command corresponding to the first read command isexecuted. The first read command in the first read-out command areassociated because they specify the same bank address in someembodiments. The read commands can include a command code, a bankaddress and a page address for page mode memories. The read-out commandcan include a bank address and a bite address to identify the dataoutput chunks associated with the read-out command. In some embodiments,sequential read commands are constrained address different banks in amemory device.

Also, the memory can receive a read command after a chip select signaltransitions. Also, the memory can receive a read command after finishingreceiving a preceding read command. Also, the memory can receive a readcommand right after finishing a data output of the specified length,using a data link that is preconfigured in some embodiments, or carriedas media data with a read-out command in other embodiments.

The technologies described providing low pin count memory capable ofreceiving and delivering multiple read commands, addresses or dataoutputs on a bidirectional I/O port, wherein one or more next readcommands (the second read, the third read . . . , RA2/RA3, or RC2/RC3)can be issued or received before the output (first output data chunkDO1) that corresponds to the first read command and address.

A number of embodiments are described in which there is at least one setof bidirectional I/O ports, or signal lines, for executing a readcommand/read address/data output sequence. Thus, low pin count memory isprovided capable of receiving and delivering multiple read commands,addresses and data outputs on a bi-directional I/O port with improvedefficiency and throughput. The protocols described enable read sequencesthat make efficient use of the read latency between read commands andthe output of corresponding data to provide improved throughput and busefficiency.

While the present invention is disclosed by reference to the preferredembodiments and examples detailed above, it is to be understood thatthese examples are intended in an illustrative rather than in a limitingsense. It is contemplated that modifications and combinations willreadily occur to those skilled in the art, which modifications andcombinations will be within the spirit of the invention and the scope ofthe following claims.

What is claimed is:
 1. A memory device, comprising: a memory array andan input/output port having an input mode and an output mode, whereinthe input/output port has at least one signal line used alternately inboth the input and output modes; and a controller including logicconfigured to execute a multi-address read operation in response toreceiving a read command on the input/output port, the multi-addressread operation including receiving a first address and a second addressusing the at least one signal line before outputting data.
 2. The memorydevice of claim 1, wherein the multi-address read operation includes:outputting a first data chunk on the input/output port after a firstlatency, in response to receiving the first address identifying thefirst data chunk; before expiration of the first latency, receiving thesecond address identifying a second data chunk; and outputting thesecond data chunk on the input/output port after a second latency. 3.The memory device of claim 1, wherein the read command indicatesbeginning of the multi-address read operation.
 4. The memory device ofclaim 2, wherein the first and second data chunks have differentlengths.
 5. The memory device of claim 1, wherein the read command isreceived with data indicating a number of addresses to be receivedbefore switching to the output mode in the multi-address read operation.6. The memory device of claim 1, wherein the multi-address readoperation includes receiving the read command upon transition of acontrol signal, and receiving the first address and the second addresswithout an intervening transition of the control signal.
 7. The memorydevice of claim 1, wherein the multi-address read operation includesafter outputting a first data chunk identified by the first address,switching to the input mode and receiving a sequence of read addressesfor subsequent data chunks interleaved with switching to the output modeand outputting data chunks identified by preceding read addresses in thesequence of read addresses.
 8. The memory device of claim 1, wherein themulti-address read operation includes receiving a plurality of readaddresses, the plurality of read addresses including the first andsecond addresses, and a read address in the plurality of read addressesfor a last data chunk in the multi-address read operation is receivedwith a final read indicator as immediate data, and the multi-addressread operation includes outputting the last data chunk after outputtinga data chunk identified by an immediately preceding address withoutchanging the input/output port from output mode to input mode forintervening input.
 9. The memory device of claim 1, wherein themulti-address read operation includes receiving a plurality of readaddresses, the plurality of read addresses including the first andsecond addresses, and one or more of the read addresses in themulti-address read operation are received with burst length indicatorsas immediate data, indicating a length of data chunks identified by theread addresses.
 10. The memory device of claim 1, wherein one or moreread addresses is received with immediate data identifying a settingapplied in the multi-address read operation.
 11. The memory device ofclaim 2, wherein the multi-address read operation includes, beforeoutputting the second data chunk, and after outputting the first datachunk, a third address identifying a third data chunk, and outputtingthe third data chunk on the input/output port after a third latency. 12.The memory device of claim 1, wherein the multi-address read operationincludes receiving a number of read addresses in sequence in themulti-address read operation on the input/output port withoutintervening output on the input/output port, and after receiving thenumber of read addresses, changing the input/output port to the outputmode and outputting data chunks identified by the received number ofread addresses in sequence without intervening input on the input/outputport.
 13. The memory device of claim 12, wherein the read command isreceived with immediate data indicating the number of read addresses.14. The memory device of claim 12, wherein the number of read addressesis set by a stored configuration parameter.
 15. The memory device ofclaim 12, wherein the read addresses are received with immediate dataindicating a chunk length for data chunks identified by the readaddresses.
 16. The memory device of claim 1, wherein the multi-addressread operation includes, before outputting a first data chunk identifiedby the first address, moving the first data chunk identified by thefirst address to a buffer, receiving a read-out command associated withthe first address, and after the first data chunk identified by thefirst address is available in the buffer, outputting the first datachunk identified by the first address in response to the read-outcommand.
 17. The memory device of claim 1, wherein the memory arrayincludes a plurality of banks, and the first and second addressesinclude a bank address of a first data segment and a second datasegment, respectively, and wherein the multi-address read operationincludes, before outputting a first data chunk identified by the firstaddress, receiving a read-out command including the bank address andidentifying the first data segment, and outputting the first data chunkof the first data segment in response to the read-out command.
 18. Thememory device of claim 17, wherein the multi-address read operationincludes receiving the read command and the first address between afirst pair of transitions of a control signal, and receiving the secondaddress between a second pair of transitions of the control signal. 19.The memory device of claim 17, wherein the multi-address read operationincludes receiving the read command and the first address between afirst pair of transitions of a control signal, and receiving theread-out command and outputting the first data chunk between asubsequent pair of transitions of the control signal.
 20. The memorydevice of claim 17, wherein the read command includes an instructioncode, and an address identifying a bank and a page of the first datasegment, and the read-out command includes an instruction code, and anaddress identifying the bank and a byte address of the first data chunkof the first data segment to be output.
 21. A method for executing amulti-address read operation on a memory device including a memory arrayand an input/output port having an input mode and an output mode,wherein the input/output port has at least one signal line usedalternately in both the input and output modes, the method comprising:receiving a read command on the input/output port in a multi-addressread operation, and in response to the read command, receiving a firstaddress and a second address using the at least one signal line beforeoutputting data; and outputting data using the at least one signal line.22. The method of claim 21, including: outputting a first data chunk onthe input/output port after a first latency, in response to receivingthe first address identifying the first data chunk; before expiration ofthe first latency, receiving the second address identifying a seconddata chunk; and outputting the second data chunk on the input/outputport after a second latency.
 23. The method of claim 21, wherein theread command indicates beginning of the multi-address read operation.24. The method of claim 22, wherein the first and second data chunkshave different lengths.
 25. The method of claim 21, wherein the readcommand is received with data indicating a number of addresses to bereceived before switching to the output mode in the multi-address readoperation.
 26. The method of claim 21, including receiving the readcommand upon transition of a control signal, and receiving the firstaddress and the second address without an intervening transition of thecontrol signal.
 27. The method of claim 21, including, after outputtinga first data chunk identified by the first address, switching to theinput mode and receiving a sequence of read addresses for subsequentdata chunks interleaved with switching to the output mode and outputtingdata chunks identified by preceding read addresses in the sequence ofread addresses.
 28. The method of claim 21, wherein the multi-addressread operation includes receiving a plurality of read addresses, theplurality of read addresses including the first and second addresses,and a read address for a last data chunk in the multi-address readoperation includes a final read indicator as immediate data, and themulti-address read operation includes outputting the last data chunkafter outputting a data chunk identified by an immediately precedingaddress without changing the input/output port from output mode to inputmode for intervening input.
 29. The method of claim 21, wherein themulti-address read operation includes receiving a plurality of readaddresses, the plurality of read addresses including the first andsecond addresses, and one or more of the read addresses in themulti-address read operation are received with burst length indicatorsas immediate data, indicating a length of data chunks identified by theread addresses.
 30. The method of claim 21, wherein one or more readaddresses is received with data identifying a setting applied in themulti-address read operation.
 31. The method of claim 22, including,before outputting the second data chunk, and after outputting the firstdata chunk, receiving a third address identifying a third data chunk,and outputting the third data chunk on the input/output port after athird latency.
 32. The method of claim 21, including receiving a numberof read addresses in sequence in the multi-address read operation on theinput/output port without intervening output on the input/output port,and after receiving the number of read addresses, changing theinput/output port to the output mode and outputting data chunksidentified by the received number of read addresses in sequence withoutintervening input on the input/output port.
 33. The method of claim 32,wherein the read command is received with data indicating the number ofread addresses.
 34. The method of claim 32, wherein the number of readaddresses is set by a stored configuration parameter.
 35. The method ofclaim 32, wherein the read addresses are received with data indicating achunk length for a data chunk identified by one of the read addresses.36. The method of claim 21, including, before outputting a first datachunk identified by the first address, moving the first data chunkidentified by the first address to a buffer, receiving a read-outcommand associated with the first address, and after the first datachunk identified by the first address is available in the buffer,outputting the first data chunk identified by the first address inresponse to the read-out command.
 37. The method of claim 21, whereinthe memory array includes a plurality of banks, and the first and secondaddresses include a bank address of a first data segment and a seconddata segment, respectively, and the method further includes, beforeoutputting a first data chunk identified by the first address, receivinga read-out command including the bank address and identifying the firstdata segment, and outputting the first data chunk of the first datasegment in response to the read-out command.
 38. The method of claim 37,including receiving the read command and the first address between afirst pair of transitions of a control signal, and receiving the secondaddress between a second pair of transitions of the control signal. 39.The method of claim 37, including receiving the read command and thefirst address between a first pair of transitions of a control signal,and receiving the read-out command and outputting the first data chunkbetween a subsequent pair of transitions of the control signal.
 40. Themethod of claim 37, wherein the read command includes an instructioncode, and an address identifying a bank and a page of the first datasegment, and the read-out command includes an instruction code, and anaddress identifying the bank and a byte address of the first data chunkof the first data segment to be output.